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INTRODUCTION 



The purpose of this Application Note is to acquaint the reader with some of the many facets of computer 
design. Beginning with a statement of purpose, we will consider the specification of a computer from a func- 
tional standpoint. Presentation of a modular architecture will then help the designer to understand the flow of 
data and instructions through the system and to help resolve the inevitable problems of resource contention 
(the desire to use a bus or functional subsystem for more than one purpose at a time). From a management 
perspective, early definition of the architecture modules allows a number of people to work on the design 
of different controlled subsystems concurrently. 

The use, or intended use of a microcomputer will be illuminated by its instruction set, the form or forms of its 
instructions, and the form and format of its data words. The computer hardware, then, must satisfy all of the 
functional requirements of the machine instructions. (In a stored-program computer, the "machine instruc- 
tions" are those system control statements that reside in the main memory and are fetched and executed by 
the computer control unit.) 

Having defined the instruction set, the various controlled subsystem components may be designed. The state 
transition diagrams may then be specified as can the control signal requirements of the system controller. 



MICROPROGRAMMED MACHINES 



The Original Equipment Manufacturer, OEM, has a number of goals that he would like to satisfy in the design 
of any new product. He must first satisfy his current requirements; however, he would like to build into his 
equipment the ability to expand the machine, increase its speed or thru-put, and allow for additional functions 
and modernization. Secondly, very few machines are ever designed without some flaw that escapes the manu- 
facturer's attention until it gets into the hands of the users in the field; the ability to change the machine 
without extensive engineering change orders is paramount. A machine that is dedicated to a specific task or a 
class of tasks, rather than a general purpose processor is becoming the rule rather than the exception as hard- 
ware costs become less important than software costs. It is very desirable, then, to be able to build a processor 
that is capable of executing very complex tasks without a large stored program; perhaps, a single macro- 
instruction will suffice. The OEM may have a previous generation of hardware that was implemented with 
random logic in the form of MSI and SSI elements. Because of a large investment in software, it would be 
desirable to implement the same instruction set using newer technology with an emulation in hardware of the 
previous machine. 

A microprogrammed processor will allow the manufacturer the flexibility to satisfy any or all of those 
requirements. Rather than trying to define a microprogrammed machine in an elegant, abstract way, the 
concepts of the microprogram will be demonstrated by example. 
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CHAPTER I 

ENGINEERING MANAGEMENT 



From an engineering management standpoint it is most im- 
portant that the following events occur in the order presented: 
production of a written specification, an architectural design, 
derivation of a management schedule based on available 
resources, the implementation of the various architectural 
blocks, system integration, an engineering review, redesign, 
and documentation. A management review should take place 
at each of these milestones before the next phase of the 
development is started. 

Specification 

It is extremely important that a written specification for the 
system is completed and agreed upon by all interested parties 
before any other work is initiated. Otherwise, the program is 
bound to be extremely inefficient in manpower, will probably 
require much more redesign activity, and may be very demor- 
alizing to the technical staff. Further, there will be no way to 
measure progress. 

In the definition of a computer, the specification may be 
delineated by at least the following sections: Design Goals, 
Formats, Instruction Requirements, Addressing Methods, 
Input-Output, Memory, and Control Panel. The form of the 
specification may change from company -to -company, or 
even from design -to -design within one company, but the 
intent of the specification should remain the same. The 
purpose of the specification is to provide the philosophy 
behind the product, a complete functional description, and 
any important limitations that are to be imposed. This is a 
reference document for the design, marketing, production, 
and management staffs. 

Architecture 

The architectural design will provide a description of each 
of the controlled functions in the machine, each of the buses 
in the system and the order of functional module contention 
resolution, if any, and the system controller. 

Each of the controlled functions, e.g., the Arithmetic Logic 
Unit, should be described functionally, and each storage device 
within the functional block should be described. The Computer 
Control Unit should be presented in a similar manner. 

The buses, and the approach to bus control that will be imple- 
mented should be described from a functional viewpoint. 

Management Schedule 

Having defined the architectural blocks of the system, it is 
then possible to start the design of the system. While the 
controlled elements of the system are being conceptualized the 
Computer Control Unit and the memory system timing 
philosophies should be worked on in detail. Ultimately, this 
portion of the design will provide the speed limitation on 
the system. Considering the selected architecture, each class 
of instruction should be flow-charted as a general case and a 
state-transition graph for the processor should be derived. 
Each machine instruction to be implemented should then be 
examined carefully to illuminate the demands placed on each 
subsystem; these demands should be tabulated by subsystem 



so that the designer of each subsystem understands what his 
portion of the system must be capable of doing. This point 
in the development cycle is an excellent time for a program 
review to determine whether the design goals can be met with 
the approach that has been initiated. 

The implementation of the various architectural blocks should 
then be scheduled. These tasks are very straight-forward and 
should provide no surprises if the initial conceptual work was 
done carefully. Each subsystem should be completely tested 
by itself before any attempt is made to connect the subsystems 
together. 

After completing the subsystem tests, the subsystems should 
be integrated (connected together using common buses, clocks, 
and control signals). From a management standpoint, system 
integration is very inefficient. Except for a very large system 
only one or two engineers can work on the complete system 
tests. Generally, the engineer who designed the Computer 
Control Unit and one other senior engineer should be assigned 
to this task. This leaves the remainder of the project staff 
idling without any direct contribution to the system design 
and test effort. This is an excellent time to start the documen- 
tation effort and liaison with the various production groups. 

The engineering review really consists of two separate reviews 
separated in time. The first review is internal to the engineering 
department and consists of design trade-offs. The second review 
is a company-wide, management level review which should 
consider the effects of the design on incoming inspection and 
Q.A., purchasing, automatic test, mechanical and electrical 
assembly, final test, marketing, sales, engineering support, field 
service, etc. The technically oriented business decisions should 
also be addressed during the second meeting. The level and 
cost of inventory, the manufacturing cost of the product, the 
marketing related support costs, reliability and warranty expo- 
sure, spare parts inventory, time to repair and field service 
costs, cost of operating and maintenance supplies, installation 
time and cost, environmental constraints, size, power con- 
sumption (can it be installed without having to rewire the 
installation facility), all of the human factors and esthetic 
considerations, packaging, effect on installed air conditioning 
and/or requirements for special cooling, "brownout" protec- 
tion, UL, CSA, IEC, etc., liability. From the corporate view- 
point, is the current staff, facility, and equipment capable of 
supporting manufacturing and selling of the product? Quite 
often the business considerations significantly outweigh the 
technical considerations. It is an exceptional engineering 
department, or a poor management team, that is capable of 
passing this second engineering review. For this reason it is 
important that a redesign period be scheduled into the pro- 
gram plan. 

How much time should be scheduled for the redesign? This 
is very hard to judge and will change from company to com- 
pany, but the time for redesign is usually controlled directly 
by the longest lead-time requirement for any non-design 
related task or product. For example, packaging components 
typically have longer lead-times associated with them than do 
standard IC's. In any case the engineering redesign time 
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estimate, and the longest lead-time estimate, and the engi- 
neering reimplementation time, and a less stringent engineering 
review time might be summed together to provide a conser- 
vative redesign task. 

Good quality, complete documentation may mean the dif- 
ference between financial success and failure. The documen- 
tation effort should continue with the direct support of the 
original engineering team. The product build-up, "explosion," 
starting with the highest level assembly and terminating with 



the lowest level assembly is important because it will provide 
purchasing order quantities that they may purchase against a 
forecast demand. This documentation should also include an 
approved vendor list of each component. 

Schematics, mechanical layouts, cabling diagrams, and any 
other information necessary to produce, test, trouble-shoot, 
purchase components, etc., must also be supplied during this 
phase of the development. A mechanism for instituting engi- 
neering change orders must also be provided. 
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CHAPTER II 

SPECIFICATION OF THE MACHINE 



Because this application note is being written for a broad 
spectrum of users and users' requirements, this specification 
and the resulting hardware design may not be the "best" 
implementation of a modern processor. There are a number of 
excursions from what might be considered an optimum design 
to illustrate technique or design trade-offs. 

DESIGN GOALS 

The intention of this design is to provide a third generation 
computer with a general-purpose instruction set that is opti- 
mized for high-speed input-output operations. 



Function 

A typical functional specification is presented here for the 
purpose of presenting an application requirement. 

This processor shall be capable of interfacing to 12-bit, 2's 
complement analog-to-digital and digital-to-analog converters 
with 500ns conversion times. Further, the ability to interface 
with dual synchronous or asynchronous communications lines 
operating up to 50 kilo-baud per channel and two parallel, 
full word-length channels directly connected to a host com- 
puter. (These interface designs are not part of this application 
note.) 

The initial design must provide for 2's complement add and 
subtract, the basic logic functions, indexed data movement 
instructions and a set of executive level I/O instructions that 
will provide for minimum service or interrupt request overhead. 
The computer must allow for expansion of its basic instruction 
set and for the addition, in microcode, of statistical and 
communications protocol algorithms. 

At least one channel of direct memory access, DMA, and 
16-levels of vectored priority interrupt are required. The 
processor must be capable of acknowledging and processing 
an interrupt without polling, calculating, or otherwise deter- 
mining which channel caused the interrupt. The DMA function 
must be slaved to the interrupt controller. That is, when an 
interrupt request occurs either a vectored interrupt or a direct 
memory access is initiated. 

The input-output channel interface must be very simple. All 
service contention (requests for input-output service generated 
by multiple data channels at the same time) will be resolved in 
the interrupt controller. 



Speed 

The main memory full cycle time must be less than or equal 
to the 500ns data conversion times includingaddress decoding, 
data source and destination control and processor overhead 
functions. 

The processor cycle time should be the minimum necessary to 
service the various controlled functions without variable or 
asynchronous clock functions. All of the central processor 
subsystems will be synchronous, but the memory and input- 
output controllers will be required to furnish a ready signal 



after each attempt to read or write has been satisfied. This 
will permit memory and peripheral devices of different speeds 
to be attached to the processor. 

Electrical 

All of the circuit elements must run off a single five volt power 
supply only; current requirements will be determined during 
the design effort. Other voltages will be supplied for control of 
peripheral devices only. 

Mechanical 

A 9x12 inch printed circuit board format will be used. Two 
2x36 edge connectors will be used on the bottom edge of the 
card (along the major dimension). 

Approximately 78 sixteen-pin integrated circuits, or an equiva- 
lent number of larger packages, may be mounted on a single, 
two-layer printed circuit board if cables are connected to the 
top of the card. Otherwise, about 96 sixteen-pin packages, or 
equivalent, may be installed on one board. This circuit density 
is around one 16-pin package per square inch. 

If a four-layer printed circuit board is used, with the two 
buried layers being power and ground distribution, on board 
decoupling and filter capacitors may be removed and the 
circuit density may be increased to one 16-pin package per 
0.75 square inches. This yields a package count of 101 per 
board or 128 per board depending on whether or not cables 
are connected to the top of the board. 

Cooling will be convective or forced-air depending upon the 
power density on the boards. Printed circuit board spacing will 
also depend on the power density and the ambient, intake air 
supply temperature. 

Technology 

Due to the speed requirement and the single 5V power supply 
availability, a bipolar integrated circuit technology must be 
used for the digital logic. To reduce the current requirement, 
low-power Schottky devices will be used where possible. 
Similarly, low-power, high-speed memory devices that are 
designed in static, n-channel MOS technology should be 
utilized for main memory. 



MAIN MEMORY FORMATS 

Main memory will contain instruction and data information. 
To maximize the data transfer role and the instruction effi- 
ciency a main memory word size of 16-bits is specified. No 
parity bit will be used. The bits will be numbered from Oto 15. 

Data Format 

The main memory arithmetic data format will be integer, 2's 
complement. The least significant bit will be and the most 
significant bit 14. Bit 15 will be the sign bit. With the excep- 
tion of the sign bit, the binary weight of any bit position will 
be 2, raised to the power of the bit position. The sign bit has 
the weight -2 15 . 
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Logical data formats will include 16-bit words and 8-bit bytes. 
Two 8-bit bytes may be stored in a single main memory word. 
The lower byte is contained in memory bit positions to 7, 
and the upper byte is contained in bit positions 8 to 15. 

Instruction Format 

Machine instructions will consist of one or two main memory 
words. (The general format will be similar to an IBM 360 
instruction.) There will be two single word instruction formats 
and two double word instruction formats: Register-to-Register 
(RR), Special Format (SF), Register Indexed (RX), and 
Register Immediate (Rl). 

The operation to be performed by the current instruction will 
reside in the upper byte of the memory word, bits 8 through 
15, providing for 256 different instructions. The remainder of 
the first word, and the second word if it exists, will be used for 
operand(s). (Operands provide data, addresses, or data source 
and destination information relating to the current instruction.) 



INSTRUCTION REQUIREMENTS 

Machine instructions may be broken down into a number of 
general functional classes: arithmetic, logic, shift, data move- 
ment, program control (branch), input-output, and executive. 
It is best to think of the instructions in these classes because 
it will be easier to define, implement, and test the instructions 
necessary to complete the tasks imposed on the product by its 
work environment. 



Arithmetic 

The following arithmetic functions are required: 

• Add 

• Add with Carry 

• Subtract 

• Subtract with Carry 

• Arithmetic Compare 

• Two's Complement (Negate) 

• Increment 

• Decrement 



Logic 

The following logical functions are required: 

• And 

• Inclusive-Or 

• Exclusive-Or 

• Logical Compare (NOR) 

• Complement 



Shift 

The following shift functions are required: 

• Shift Register Left 

• Shift Register Right 

• Shift Register Left with Carry 

• Shift Register Right with Carry 

• Rotate Register Left 

• Rotate Register Right 

• Rotate Register Left with Carry 

• Rotate Register Right with Carry 



Data Movement 

The following data movement instructions are required: 

• Load Register from Memory 

• Store Register in Memory 

• Transfer from Register-to-Register 

Program Control 

The following program control instructions are required: 

• Jump 

• Jump-to-Subroutine 

• Return 

Input-Output 

The following input-output instructions are required: 

• Read Data into Register 

• Read Status into Register 

• Write Data from Register 

• Write Status from Register 



Executive 

The executive instructions are difficult to define in a prelimi- 
nary specification because they relate to the processor design, 
not to the tasks the processor must accomplish. All of the 
machine instructions necessary to load interrupt vectors, direct 
memory addresses and word counts, and interrupt masks and 
pointers must be included in this set of instructions, as well as 
enable and disable interrupt and DMA commands, etc. 

ADDRESSING METHODS 

The program control and data addressing techniques available 
in the machine level programs will depend upon the instruction 
class that is used. 

In the register-to-register instruction, RR, Figure 1a, the data 
source and destination are both register addresses; Ri and R2- 
Very similar to the RR instruction is the special-function, SF, 
Figure 1b. The data to be tested, manipulated, compared, or 
otherwise operated on is selected by the operand field R-|. The 
second field, F, is used to specify a particular function. 

In the case of the register-indexed instruction, RX, Figure 1c, 
the first operand is specified by field Ri and the second 
operand is specified by the sum of the address field, word 2, 
and the contents of the general purpose register selected for 
use as an index register by the field X. The zero-th register 
may not be used as an index register. If zero is specified in the 
X field the effective address of the second operand is the 
address field contents of the second instruction word. 

The register-immediate instruction, RI, Figure 1d, is very 
similar to the RX instruction except the second operand data 
is already contained in the A-field (the second instruction 
word). As in the RX instruction, the A-field of the RI instruc- 
tion will be added to the contents of the register specified by 
the X-field, but, instead of forming an effective address for the 
second operand, the result of the summation is literal data that 
will be used immediately as the second operand. Again, if X 
is zero, the second word is not modified prior to use. 

Next instruction addressing (program control) will be accom- 
plished with a program counter, PC, a push-pop stack, STACK, 
and a set of program modification instructions. The program 



modification instructions jump, jump-to-subroutine, and 
return-from-subroutine will all affect the next instruction 
address by controlling either the PC or the STACK. 

INPUT-OUTPUT 

Peripheral controllers will be self-contained, high-speed, parallel 
transfer devices that will be capable of both programmed and 
automatic operation. All hardware necessary'to control the 
peripheral device, request CPU service and handle the transfer 
of data will be contained in the controller. 

Programmed I/O 

The peripheral controller will be capable of handling two 
output instructions: Output Data and Output Command. The 
device address will be impressed on the eight lower-order 
address bus lines and the data or command word will be im- 
posed on the system data bus. 

The processor will expect a ready signal from the peripheral 
controller before it will execute a programmed I/O instruction. 
If the I/O port is not ready when the instruction execution is 
attempted, it will go into a wait state until the device ready 
signal can be synchronized with the Computer Control Unit. 

Automatic I/O 

Automatic I/O will consist of three different types of input- 
output service requests: vectored interrupt, cycle stealing 
direct memory access, and CPU freeze direct memory access. 
There will be 16-levels of priority encoded service request, the 
necessary contention logic, and 16-levels of request acknow- 
edged circuitry. All service requests will be processed through 
the Interrupt Control Unit. 

An interrupt request will be serviced after the current instruc- 
tion being executed is finished, and before the next instruction 
is fetched. The PC will be pushed onto the stack and a vector 
address in main memory, previously assigned to a particular 
level of interrupt, will be loaded into the PC. Execution of a 
return from interrupt instruction will cause the stack to be 
popped onto the PC and normal processing will resume. 



Cycle stealing Direct Memory Access will not have to wait for 
an instruction to be concluded to be given permission to steal 
a memory cycle. If the memory is not being used during a 
given cycle the DMA controller is free to attempt an access 
during which time interrupts may not be serviced. This mode 
of operation only permits the reading or writing of a single 
word, and then the DMA controller must relinquish system 
control. 

Block transfer Direct Memory Access will wait for service in 
the same manner that an Interrupt Request waits. When the 
current instruction being executed is complete, the DMA 
controller will usurp control of the data bus and peripheral 
processor until the entire DMA sequence is satisfied. A block 
transfer DMA operation may not be interrupted. 

MAIN MEMORY 

Main memory will consist of static RAM's organized in 4096 
words by 16-bits. Up to 16 of these memory blocks may be 
directly addressed by any instruction or control unit, without 
paging, for a total addressing range of 64K words. 

The memory will be built from NMOS static RAM's using a 
single power supply of +5Vdc and providing an access time of 
200ns or less and a total cycle time of 400ns or less. The 
memory system must be capable of providing a READY flag 
in response to a read or write request. 

The maximum load presented to any data bus or address bus 
by any single memory board will be 0.4mA or less. A low- 
capacitance, low-current bus will be maintained. 



CONTROL PANEL 

A system control panel will be provided to aid both the logic 
designer and the programmer in implementing and debugging 
their tasks. As well as examining or writing main memory and 
monitoring CPU status, the control panel shall be capable of 
loading programs, interrupting or restarting programs, pausing 
and continuing tasks, check-point/restart, etc. 
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a) Register-to- Register Instruction, RR b) Special Function Instruction, SF 
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d) Register Immediate Instruction, Rl 



Figure 1. Instruction Types. 
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CHAPTER III 
ARCHITECTURE 



The computer architecture is depicted in Figure 2. There are 
two major buses in the processor: A 16-bit wide address bus, 
and a 16-bit wide data bus. There is a third bus in the system, 
the control bus, but it will be discussed later with the computer 
control unit. The data and address buses are three-state buses; 
each system element that connects to a bus will present one 
Am25LS, low-power Schottky unit load to the bus. 

Arithmetic Logic Unit 

The Arithmetic Logic Unit, ALU, will be used to perform all 
of the combinatorial arithmetic and logic functions as well as 
the shift and rotate instructions. Arithmetic and logical com- 
parisons, 1's and 2's complement, and incrementing and 
decrementing will be accomplished in the ALU. The ALU 
will also contain 16 general purpose registers and a Q-register 
for use as a scratch pad or for long shift sequences. The ALU 
may act as either a source or a destination for data on the 
data bus. 



Figure 2. Computer Architecture. 



Program Control Unit 

The Program Control Unit, PCU, consists of three important 
elements: the Program Counter, PC, the Memory Address 
Register, MAR, and the Subroutine Stack. When the system's 
primary power is turned on, the PC is cleared to 0. From that 
time on the PC will point to (contain the address of) the next 
instruction to be fetched and executed. The MAR is used to 
hold the second operand of an instruction or the effective 
address of the second operand which was calculated in the 
ALU. In either case the MAR is loaded from the data bus. 
Either the PC or the MAR is enabled onto the address bus at 
the start of any memory cycle. 

There are three types of instruction that the PCU will execute 
when it is not merely incrementally stepping its way through 
a program of instructions. They are called program modifica- 
tion instructions and consist of: Jump, Jump-to-Subroutine, 
and Return. 

The Jump instruction is a two word instruction. The first 
word, with the operation code "Jump," is loaded into the 
instruction register and decoded. If the instruction is a register 
indexed, RX, type and the condition code is true, an effective 
jump address will be calculated and the contents of that address 
will be jammed into the PC. If the instruction type is register 
immediate, Rl, the second word of the instruction will be 
added to the index value and directly loaded into the PC. Once 
the jump takes place the PC will continue incrementing its way 
through program storage until another program modification 
instruction is encountered or its largest value has been reached. 
The Jump-to-Subroutine instruction is very similar except that 
the Stack is used. If the condition code in the first word of the 
instruction is true, the contents of the PC are pushed onto the 
Last-In-First-Out, (LIFO) Subroutine Stack. The address of the 
subroutine, having been derived in the same manner as the 
Jump address, is loaded into the PC and processing continues. 
The Return instruction is executed at the end of a subroutine 
sequence and causes the last entry into the Stack to be popped 
onto the PC, thereby returning the program control to the 
address where the subroutine was called plus one. 

Memory 

With a total memory size of 64Kx 16 we will look at the me- 
chanical design goal of 12x9 inch printed circuit boards. 'It 
immediately becomes obvious that a single board will hold a 
16K x 16 memory module along with its associated decoders, 
buffers, and transceivers. Modules of different speed memories 
may be mixed in a single system due to an asynchronous 
memory ready signal which is synchronized by the CCU. 

Interrupt Control Unit 

The ICU is capable of receiving 16 levels of interrupt request 
from peripheral controllers. Each of those interrupt request 
lines is capable of being masked (enabled or disabled), and 
cleared under program control. Further, a minimum interrupt 
level, or fence, may be implemented with an instruction. The 
highest level interrupt will win the contention race and will be 
presented in the form of 4-bit interrupt address vector. 



INSTRUCTION 
REGISTER 

COMPUTER 
CONTROL 
UNIT 



MICROPROGRAM 
REGISTER 



GENERAL PURPOSE 
REGISTERS 



ARITHMETIC/LOGIC 
UNIT 



SUBROUTINE 
STACK 



PROGRAM COUNTER 



CONDITION CODE 
AND 
SYNC. LOGIC 



INTERRUPT 
CONTROL UNIT 



CONTROL 
PANEL 



J 



> 

5t 



MEMORY ADDRESS 
REGISTER 



WORD COUNTER 



DIRECT MEMORY 
ACCESS CONTROLLER 



BASE ADDRESS 
REGISTER 



\7 




TO INTERFACE 
CONTROLLERS 



The 4-bit vector is presented to a 16 word by 16-bit two port 
RAM which is loaded under program control with interrupt 
vector addresses. The data input port and the "B" output port 
are connected to the data bus. The "A" output port is used to 
detect a DMA attempt. 

When an interrupt request is received, the CCU grants interrupt 
permission at a convenient point in the microinstruction 
sequence. The interrupt acknowledge is transmitted back to 
the requesting peripheral controller and the interrupt vector 
is enabled onto the system data bus. With the interrupt vector 
as the address source the PCU is commanded to do a Jump-to- 
Subroutine. When the interrupt program is finished, a Return- 
from-lnterrupt will cause the interrupt to be cleared and the 
PCU to do a pop. Normal data processing will continue. 

The only invalid interrupt vector address is FFFF 16 , which is 
the highest address the machine is capable of handling. This 
address is reserved for DMA linkages. If the DMA linkage 
condition is detected on the "A" output port of the interrupt 
vector RAM, control is passed to the DMA controller. 

Direct Memory Access 

A single DMA controller channel is planned for this machine 
although any number may be added. For example, if 16 levels 
of DMA were desired, 16 interrupt vector addresses could be 
invalidated, i.e., FFFC^g through FFFF-ig. (The DMA con- 
troller may reside either within the CPU, or on the interface 
circuit board.) 

The DMA consists of two 16-bit binary counters with a 
terminal count detector. One of the counters is called the 



address counter, and its outputs are connected to the address 
bus through a three-state buffer. The other is the word tranfer 
counter. Both counters' parallel inputs are connected to the 
data bus, so that they may be initialized. 
The address counter is initially loaded with the address to, or 
from, which the first data transfer will take place, depending 
upon the direction of the data flow. The requesting peripheral 
controller will be the source of the destination. The word 
counter is initially loaded with the 2's complement of the 
number of words to be transferred. Each time a data transfer 
takes place, both the address and word counters are incre- 
mented simultaneously. When the word counter reaches its 
terminal count the last DMA transfer will be completed. 

Computer Control Unit 

The CCU consists of an instruction register whose input port is 
connected to the data bus and whose Op code output port is 
connected to a starting address PROM. The output of the 
starting address PROM is connected to the direct input of a 
microprogram sequencer, the output of which drives a micro- 
program PROM. The microprogram memory is 256 words deep 
by 40-bits wide which is sufficient for the present machine 
design. A pipeline register is used to hold the current micro- 
instruction to increase the system thru-put. 

The outputs from the Micro Program Register, MPR, control 
each of the architectural blocks. In fact, some of the MPR 
information is fed back to control the CCU. The system clocks, 
condition code and synchronization logic and control panel 
logic are all components of the CCU. 
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CHAPTER IV 

MACHINE INSTRUCTIONS 



This section presents the machine instructions that are to be 
implemented in the processor. They exist in four instruction 
types (Figure 1) and seven instruction classes. The four types 
include register-register, special function, register indexed, and 
register immediate. The seven classes include arithmetic, logic, 
shift, data movement, program control, input-output and 
executive. Not all instructions will be implemented in all four 
types because they either do not make much sense or are 
redundant, or they have little or no value. In the symbolic 
descriptions, the expression on the left of the arrow is the 
location or device to be changed. Parentheses and square 
brackets denote "the contents of"; braces indicate terms 
which are collected prior to use. 

ARITHMETIC 

All of the instructions in this class are executed in the ALU 
portion of the computer. The condition codes available in 
the ALU include carry, "C,' ! the most significant bit, "S," 
overflow, "0," word contents zero, "Z," upper byte zero, 
"U," lower byte zero, "L," A greater than B, "A," and A less 
than B, "B." The condition codes "A" and "B" are only valid 
after a subtract function of the form A minus B. Condition 
codes only change after the execution of ALU instructions. 

ADD 

Add Registers, RR 



Add Register Indexed with Carry, RX 



ADR 



^1 



15 8 7 4 3 

Rl -<Ri) + (R2) 

Add Register Indexed, RX 



ARC 



15 



7 4 



3 



Rl <-<Ri) + (R2> + C 



Add Registers Indexed with Carry, RX 



AXC 


R 1 


X 


I 


15 8 


7 4 


3 


15 




1 


Rl «- (Ri) + [A 


+ (X)]+C 








Add Register Immediate with Carry, Rl 






AIC 




X 


A 


15 8 


7 4 


3 


15 








Rl *~ (Rl) + A+ (X) + C 



SUBTRACT 

Subtract Registers, RR 



SUR 



ARX 


»1 


X 






15 8 


7 4 


3 








15 8 


7 


4 


3 


15 







hi <- mi j - tH2> 










Rl *- (Ri) +[A 


+ 


(X)] 






















Add Register Imrr 


ediate, Rl 








Subtract Register Indexed, RX 








ARI 


R 1 


X 


A 




SRX 


«1 


X 


A 


15 8 


7 4 


3 


15 









15 8 


7 4 


3 


15 




1 



Rl <-<Ri) +A+ (X) 



R1 *- (Ri> -[A+ (X)] 



Add Register to Memory, RX 



ARM 








Subtract Register Immediate, R 


«1 


X 


A 





15 



7 4 



3 



15 



|a+(X)}^(Ri)+[A + (X)] 



Note: For all Rl and RX formats, if X = then (X) = 0. 



SRI 



15 



7 4 



3 



15 



R1^(R1)-{A+(X)} 



Subtract Registers with Borrow, RR 



Increment Memory Indexed, RX 



SRC 


*1 


R 2 




INCM 




jl\ ; 


15 8 


7 4 


3 




15 8 


7 4 


3 1 15 1 



R1 <-(Ri)-(R 2 )-C 

Subtract Registers Indexed with Borrow, RX 



sxc 


»1 


X 


A 


' 15 8 


7 4 


3 


15 ' 



R1 <-(Ri) -[A + (X)]-C 
COMPARE 

Arithmetic Compare Registers, RR 



CAR 



1 1 



15 8 7 4 1 3 1 

CC^(Rl) : (R 2 ) 

Arithmetic Compare Register Indexed, RX 



CC<-(Ri) :[A + (X)] 

Arithmetic Compare Register Immediate, Rl 



CAI 


R 1 


X 


A 



CC«-(Ri) :{A+ (X)| 
COMPLEMENT 

Two's Complement Register, RR 



TWOS 



15 8 7 4 3 

Rl ^(Rl) + 1 

INCREMENT AND DECREMENT 

Increment Register, RR 



INCR 



'15 8 7 4 1 3 1 

R1 <-(-Ri) + 1 

Note: For all Rl ana; RX formats, if X = then (X) = 0. 



{a+(X)}«-[A + (X)] + 1 
Decrement Register, RR 



DECR 



' 15 8" 7 4 ' 3 ' 

R1 «-(Rl)-1 

Decrement Memory Indexed, RX 



DECM 


■■1 


X 


A 


15 8 1 7 4 
Ia+ (X)}<-[A + (X)]- 


3 q 
- 1 


15 ' 



LOGIC 

All of the instructions in the logic class are executed in the 
ALU portion of the computer. The same condition codes that 
were presented with the arithmetic instructions are incumbent 
on the logic instructions. The condition codes only change 
after the execution of an ALU instruction. 



I 

CAX 


«1 


X 


A 


AND 


' 15 8 


7 4 


3 


15 



And Registers, RR 



NRR 


Ri 


R 2 








' 15 8 


7 4 


3 








Rl <-(Ri) .AND. (R 2 ) 








And Registers Indexed, RX 








NRX 


*1 


X 


A 


' 15 8 


7 4 


3 


15 




1 


RT <-(Ri) .AND. [A + (X)] 








And Registers I 


mmediate, Rl 








NRI 


», 


X 


I 


' 15 8 


7 4 


3 


1 15 




1 



Rl <-(Ri) • AND.|a + (X)| 



INCLUSIVE-OR 

Or Registers, RR 



ORR 


*1 


R 2 


15 8 


7 4 


3 


R1 <-(Ri) .OR. (R 2 ) 





10 



Or Register Indexed, RX 



ORX 




X 


A 


15 8 


7 4 


3 


15 







R 1 *-(Ri) .OR. [A + (X)] 








Or Register Immediate, Rl 








ORI 


R 1 


X 


A 


15 8 


7 4 


3 


15 








COMPLEMENT 

Complement Register, RR 



CMPL 



1 1 



R-\ <-(Ri) .OR.{a + (X)| 

EXCLUSIVE-OR 

Exclusive-Or Registers, RR 



15 8 7 4 3 

R1 «-«?) 

SHIFT INSTRUCTIONS 

All of the shift instructions are executed in the ALU portion 
of the computer. Only the carry and zero condition codes may 
be affected by these instructions. 

LOGICAL SHIFT 

Shift Register Left, SF 



SRL 



15 



7 4 3 



XRR 


R 1 


R 2 








15 8 


7 4 


3 








RT <- (Rl) . EXOR . (R2) 








Exclusive-Or Regsiter Indexed, RX 






XRX 


"l 


X 


A 


15 8 


7 4 


3 


15 







Rl <- (Ri) . EXOR . [A + (X)] 








Exclusive-Or Register Immediate, Rl 






XRl 


Rl 


X 


A 


15 8 


7 4 


3 


15 








15 



Each bit in the register selected by Ri is shifted left N places. 
The register is filled with zeroes. 



sr Left with Carry, SF 


SRLC 


Ri 


N 


15 8 


7 4 


3 



r 



15 



Shift Register Right, SF 



Rl <- (Ri) . EXOR .{a + (X)} 


SRR 


R 1 N 




15 8 


7 4 ' 3 



LOGICAL COMPARE 

Logical Compare Registers, RR 



"0" — ~15 



CLR 



15 



R 1 


R 2 


7 4 


3 



Shift Register Right with Carry, SF 



SRRC 



CC^ (Rt) : (R 2 ) 

Logical Compare Register Indexed, RX 



15 



7 4 



3 



CLX 


R 1 


X 


A 


15 8 


7 4 


3 


15 








"0"— j C [ — - 15 



CC<- (Ri) : [A+ (X)] 



ROTATE 

Rotate Register Right, SF 



Logical Compare Register Immediate, Rl 


RRR 


R 1 


N 


CLI 


R 1 


X 


A 




15 8 


7 4 


3 



15 



7 4 3 15 



CC«-(Rl) :{A+ (X)} 





15 












Note: For al! Rl and RX formats, if X = then (X) = 0. 
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Rotate Register Right with Carry, SF 



Store Register Indexed, RX 



mm 


»1 


N 




15 


8 


7 4 


3 




C 




15 












STX 


Ri 


X 


A 



Rotate Register Left, SF 



RRL 



15 





N 


' 7 4 


3 



15 



{ A+ (X)}<- Ri 
PROGRAM CONTROL 

The program control instructions contain a 4-bit condition 
code selection field. The condition code selected must be true 
for the program control instruction to be executed. Eight of 
the condition codes are generated in the ALU and the re- 
maining eight reside in the CCU. One of the condition codes, 
"1111," selects a constant logic level "1" condition providing 
an unconditional instruction. The condition codes will be 
discussed in some detail in the ALU and CCU sections. 

JUMP 

Jump through Register, SF 



Rotate Register Left with Carry, SF 


JTR 


CC 


R 2 




15 8 


7 4 


3 



RRLC 



15 



7 4 



3 












15 







CC = True; PC<-(R2> 
CC = False; PC «- (PC) + 1 

Jump Indexed, RX 



DATA MOVEMENT 

The data movement instructions are used to transfer data from 
memory to a general purpose register, from a general purpose 
register to memory, and from register to register. 



JMPX 



15 



CC 



8 7 4 3 15 



CC = True; PC ^ [A + (X)] 
CC = False; PC «- (PC) + 1 

Jump Immediate, RI 



Load Register, RR 



JMPI 



CC 


X 


7. 4 


3 



LDR 



15 8 7 4 3 

R1«-(R2> 

Load Register Indexed, RX 



1 15 

CC = True; PC «- A + (X) 
CC = False; PC <- (PC) + 1 



JUMP-TO-SUBROUTINE 

Jump-to-Subroutine through Register, SF 



LRX 


R 1 


X 


A 




JSR 


CC 


R 2 


15 8 


7 4 


3 


15 




15 8 


7 4 


3 



R-\ <-[A + (X)] 

Load Register Immediate, RI 



CC = True; Stack <- (PC), PC <- (R 2 ) 
CC= False; PC*- (PC) + 1 



Jump-to-Subroutine Indexed, RX 



LRI 


*1 


X 


A 


1 15 


8 


7 4 


3 


15 








JSX 



15 



CC 


X 


7 4 


3 



R1 ^A + (X) 

Note: For all RI and RX formats, if X = then (X) = 0. 



CC = True; Stack <- (PC), PC <- [A + (X)] 
CC = False;PC^(PC) + 1 



Jump-to-Subroutine Immediate, Rl 



JSI 


CC 


X 


A 


15 


8 


7 4 


3 


15 








CC = True; Stack < (PC), PC *- A + (X) 
CC = False; PC «- (PC) + 1 

RETURN 

Return-from-Subroutine, SF 



CC 


R 2 


7 4 


3 



CC = True; PC «- (Stack) 
CC=False;PC<-(PC) + 1 

INPUT-OUTPUT 

There are two types of input-output instructions: Data and 
Control. When either type of instruction is executed, an input- 
output device address will be loaded into the Memory Address 
Register and presented on the eight least significant address 
lines. Depending on the instruction type, the device address 
will be transferred from a register, come indirectly from a 
memory location, or be calculated from an immediate operand. 
In any case, the upper 8 bits of the address bus are not defined 
within the central processor design, so they may be defined 
and used by the I/O designer. 

DATA 

Read Data into Register, RR 

READ R-| R 2 

Hi 8 ' 7 4 3 iP 

ADDRt *< (R 2 ) 
R1 *- (DB)tt 

^ADDR is the mnemonic for the address bus. 
^DB is the mnemonic for the data bus. 



Read Data into Memory Indexed, RX 



RDMX 


*1 


X 


A 


15 8 


7 4 


3 


15 







ADDR <- (R-|) 
[A + (X)] <- (DB) 










Read Data into Memory Immediate, Rl 






RDMI 


*l 


X 


A 


15 8 


7 4 


3 


15 







ADDR «- (Ri) 
A+ (X) *• (DB) 












Read Data into Register Immediate, Rl 






RORI 


R 1 


X 


A 


15 8 


7 4 


3 


15 








ADDR <- A + (X) 
Rl <~ (DB) 

Note: For all Rl and RX formats, if X = then (X) = 0. 



Write Data from Register, RR 
WRITE R 2 

ADDR *- (R 2 ) 
DB *- (Rf) 

Write Data from Memory Indexed, RX 



WRMX 


R 1 


X 


A 


15 8 


7 4 


3 


15 








ADDR «- (R-i) 
DB^[A+ (X)] 

Write Data Immediate, Rl 



WRMI 


Ri 


X 


A 


15 8 


7 4 


3 


15 







ADDR *- (Ri) 
DB^ A + (X) 












Write Data from Register Immediate, Rl 






WRRI 




X 


A 


15 8 


7 4 


3 


15 








ADDR *- A+ (X) 
DB «- (Ri ) 

CONTROL 

Read Status, RR 

RSTS R, R 2 

"li S 7 ! 4 ' 3 

ADDR <- (R 2 ) 
R1 <-(DB) 

Read Status Immediate, Rl 



RDSI 


Ri 


X 


A 


15 8 


7 4 


3 


15 




1 



ADDR<-A + (X) 
Ri -«-(DB) 

Write Command, RR 



WCMD R, R 2 

15 8 ' 7 4 ' 3 O" 

ADDR <- (R 2 ) 
DB-HRl) 

Write Command Immediate, Rl 



WCMI 


Ri 


X 


A 



15 8 7 4 3 15 

ADDR *- A + (X) 
DB *- (Ri) 



EXECUTIVE 

The executive instructions control the interrupt control unit 
and the direct memory access controller. They will cause ad- 
dresses to be loaded and functions to be turned on or off. This 
set of instructions also provides control information to other 
functional systems in the processor. 

ALU 

Set Carry, RR 



Load Interrupt Service Vector Immediate, Rl 



STC 



R 



15 8 7 4 3 

CARRY *- "1" 

Reset Carry, RR 



ISVI 


SR, 


X 


A 


15 8 


7 4 


3 


15 


SR^A + (X) 








Clear Interrupt Request, RR 




CIR 


R 1 


R 2 




15 8 


7 4 


3 



ICC<- (R2) (Clears interrupt request for each level where R2 
contains a "1".) 



Clear Interrupt Request Indexed, RX 



RTC 


R 1 


R 2 




CIRX 


R 1 


X 


A 


15 8 


7 4 


3 




15 8 


7 4 


3 


15 



CARRY ^"0" 
ICU 

Mask Interrupt Register, RR 



ICC^[A+ (X)] 

Clear Interrupt Request Immediate, Rl 



MIR 



15 8 
MASKR *- (R 2 ) 



7 4 



3 



Mask Interrupt Register Indexed, RX 



ICC-^A + (X) 



Clear Interrupt Vector, RR 



MIX 


"1 


X 


A 




CIV 


"1 


R 2 


15 8 


7 4 


3 


15 




15 8 


7 4 


3 



MASKR <- [A + (X)] 

Mask Interrupt Register Immediate, Rl 



V *- "0" (Clears last vector read.) 
Read Status Register, RR 



Mil 


"1 


X 


A 


15 8 


7 4 


3 


15 







MASKR ^A+ (X) 










_oad Interrupt Service Vector, 


RR 






ISVR 


SR, 


R 2 








15 8 


7 4 


3 



RSR 


*1 


R 2 


15 8 


7 4 


3 


R1 MS) 






Load Status Register, RR 


LSR 


R 1 


R 2 



SR *- (R2) (The address in R2 becomes the vector for the 
interrupt level specified in SR -| .) 

Load Interrupt Service Vector Indexed, RX 



ISVX 


SR, 


X 


A 


15 8 


7 4 


3 


15 








15 8 7 4 3 

S^ (R 2 ) 

Load Status Register Indexed, RX 



SR ^[A + (X)] 

Note: For all Rl and RX formats, if X = then (X) = 0. 



CIRI 


R 1 


X 


A 


15 8 


7 4 


3 


15 








LSRX 


R 1 


X 


A 


15 8 


7 4 


3 


15 








S<-[A + (X)] 



14 



Load Status Register Immediate, Rl 



LSRI 


"1 


X 


I 


' 15 8 

S<-A+ (X) 




' 3 


1 15 







Enable Interrupt System, RR 








EIS 


"1 


R 2 








' 15 8 
EIFF *- "1" 


7 4 


3 








Disable Interrupt System, RR 








DIS 


«1 


R 2 








15 8 
EIFF<-"0" 


7 4 


3 








Clear Interrupt Mask, RR 








CIM 


R 1 


R 2 








15 8 
M <- "0" 


7 4 


3 








Return-from-lnterrupt, RR 








RFI 


Hi 


R 2 








'15 8 1 7 4 1 3 
Pop Stack; Enable Interrupts 








Load Base Address Reg 


ster, RR 






LBAR 


«1 


R 2 








15 8 


7 4 


3 









BAR<-(R 2 ) 



Load Base Address Register Indexed, RX 



LBAX 


"*1 


x 


I 


15 8 1 7 4 
BAR <- [A + (X)] 


3 


15 







Load Base Address Register Immediate, Rl 






LBAI 


»1 


X 


A 


15 8 
BAR <- A + (X) 


7 4 


3 


15 




1 


Load Word Count Register, RR 








LWCR 


"1 


R 2 








15 8 
WCMR 2 ) 


7 4 


3 ' 








Load Word Count Register Indexed, RX 






LWCX 


R 1 


X 


A 


15 8 
WC^[A + (X) 


7 4 

I 


3 


15 







Load Word Count Register Immediate, Rl 






LWCI 


R 1 


X 


A 


' 15 8 


7 4 


3 


15 




1 



WC<-A+ (X) 



Note: For all Rl and RX formats. If X = then 1X1=0. 
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CHAPTER V 

ARITHMETIC LOGIC UNIT 



A block diagram of the arithmetic logic unit is presented in 
Figure 3. All of the arithmetic and logic instructions are 
executed in the ALU, as well as effective address generation 
for the RX and Rl classes of instructions. The ALU is a 16-bit 
parallel subsystem with a high-speed lookahead carry generator. 

At the heart of the ALU subsystem is four 4-bit slice Am2901's, 
providing 16 general purpose registers, each of which is 16 bits 
wide. Also contained in the Am2901 is a parallel combina- 
torial logic array (to be discussed below) with the appropriate 
generate and propagate signals that are provided to the Am2902 
lookahead carry unit. The Am2902 accepts the propagate and 
generate signals from each 4-bit slice and provides the carry-in 
signal to the next stage and an anticipatory signal to successive 
stages. The generate and propagate signals that are output 
from the Am2902 provide information that can be decoded 



for compare functions to show the relative magnitude between 
the two values. (Compare does not work for 2's complement 
numbers.) 

To allow for various shift methods, an external set of multi- 
plexers is provided to introduce long and short shifts and 
rotates and to provide a way to link carry and/or logical "0" 
or "1" into the shift path. 

The signals generated in the Am2901 showing the status of 
the ALU plus the status and condition signals that are derived 
in the ALU are stored into a register each time the ALU is 
used. A multiplexer is used to select one-of-eight test condi- 
tions for presentation to the computer control unit for the 
purpose of providing information on which a decision is made 
to execute a conditional jump. 



RQO- 
RRO - 
CARRY - 
"0" - 



IC2 
1C3 
2C0 
2C1 
2C2 
2C3 



l LRO 



-a 



DB12:15 DB8:11 



RAM 3 



^—5 



X3 



C n+ 2 G 2 f 7 C n+y G l 

Am2902 LOOK-AHEAD CARRY 



'1 c„ +x 



2A. 2B 



OVR - 
ZERO - 
CUB - 
CLB - 
A >B - 
A < B - 



RQO - 
LRO - 
RRO - 
COUT - 



"4 



Figure 3. Arithmetic Logic Unit Block Diagram. 
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Additionally, logic is provided to present the proper signal to 
the carry flip-flop for storage during the next clock cycle. The 
output of the carry flip-flop is gated with a signal Force Carry- 
In which is controlled from microcode. 



THE Am2901 

A detailed block diagram of the bipolar microprogrammable 
microprocessor structure is shown in Figure 4. The circuit is a 
four-bit slice cascadable to any number of bits. Therefore, all 
data paths within the circuit are four bits wide. The two key 
elements in the Figure 4 block diagram are the 16-word by 
4-bit, 2-port RAM and the high-speed ALU. 

Data in any of the 16 words of the Random Access Memory 
(RAM) can be read from the A-port of the RAM as controlled 
by the four-bit A address field input. Likewise, data in any 
of the 16 words of the RAM as defined by the B address field 
input can be simultaneously read from the B-port of the RAM. 
The same code can be applied to the A select field and B select 
field; in which case, the identical file data will appear at both 
the RAM A-port and B-port outputs simultaneously. 

When enabled by the RAM write enable (RAM EN), new data 
is always written into the file (word) defined by the B address 
field of the RAM. The RAM data input field is driven by a 
three-input multiplexer. The configuration is used to shift the 
ALU output data (F), if desired. 

The RAM A-port data outputs and RAM B-port data outputs 
drive separate four-bit latches. These latches hold the RAM 
data while the clock input is LOW. This eliminates any possible 
race conditions that could occur while new data is being 
written into the RAM. 

The high-speed Arithmetic Logic Unit (ALU) can perform 
three binary arithmetic and five logic operations on the two 
4-bit input words R and S. The R input field is driven from 
a 2-input multiplexer, while the S input field is driven from 
a 3-input multiplexer. Both multiplexers also have an inhibit 
capability; that is, no data is passed. This is equivalent to a 
"zero" source operand. 

Referring to Figure 4, the ALU R input multiplexer has the 
RAM A-port and the direct data inputs (D) connected as 
inputs. Likewise, the ALU S input multiplexer has the RAM 
A-port, the RAM B-port and the Q register connected as 
inputs. 

This multiplexer scheme gives the capability of selecting 
various parts of the A, B, D, Q and "0" inputs as source 
operands to the ALU. These five inputs, when taken two at 
a time, result in ten possible combinations of source operand 
pairs. These combinations include AB, AD, AQ, AO, BD, BQ, 
BO, DQ, DO and Q0. It is apparent that AD, AQ and AO are 
somewhat redundant with BD, BQ AND BO in that if the A 
address and B address are the same, the identical function 
results. Thus, there are only seven completely non-redundant 
source operand pairs for the ALU. The Am2901 micropro- 
cessor implements eight of these pairs. The microinstruction 
inputs used to select the ALU source operands are the Irj, h 
and I2 inputs. The definitions of l(), h and I2 for the eight 
source-operand combinations are as shown in Figure 5. Also 
shown is the octal code for each selection. 



The two source operands not fully described as yet 3re the D 
input and Q input. The D input is the four-bit wide direct data 
field input. This port is used to insert all data into the working 
registers inside the device. Likewise, this input can be used in 
the ALU to modify any of the internal data files. The Q 
register is a separate four-bit file intended primarily for multi- 
plication and division routines, but it can also be used as an 
accumulator or holding register for some applications. 

The ALU itself is a high-speed arithmetic/logic operator 
capable of performing three binary arithmetic and five logic 
functions. The I3, I4, and I5 microinstruction inputs are used 
to select the ALU function. The definition of these inputs is 
shown in Figure 6. The octal code is also shown for reference. 
The normal technique for cascading the ALU of several devices 
is in a lookahead carry mode. Carry generate, G, and carry 
propagate, P, are outputs of the device for use with a carry 
lookahead generator such as the Am2902. A carry-out, C n +4, 
is also generated and is available as an output for use as the 
carry flag in a status register. Both carry-in (C n ) and carry-out 
(C n +4) are active HIGH. 

The ALU has three other status-oriented outputs. These are 
F3, F = 0, and overflow (OVR). The F3 output is the most 
significant (sign) bit of the ALU and can be used to determine 
positive or negative results without enabling the three-state 
data outputs. F3 is non-inverted with respect to the sign bit 
output, Y3. The F = output is used for zero detect. It is an 
open-collector output and can be wire OR'd between micro- 
processor slices. F = is HIGH when all F outputs are LOW. 
The overflow output (OVR) is used to flag arithmetic opera- 
tions that exceed the available two's complement number 
range. The overflow output (OVR) is HIGH when overflow 
exists, that is, when C n +3 and C n +4 are not the same polarity. 

The ALU data output is routed to several destinations. It can 
be a data output of the device and it can also be stored in the 
RAM or the Q register. Eight possible combinations of ALU 
dest'nation functions are available as defined by the Ig, I7, and 
Ig microinstruction inputs. These combinations are shown in 
Figure 7. 

The four-bit data output field (Y) features three-state outputs 
and can be directly bus organized. An output control (OE) is 
used to enable the three-state outputs. When OE is HIGH, the 
Y outputs are in the high-impedance state. 

A two-input multiplexer is also used at the data output such 
that either the A-port of the RAM or the ALU outputs (F) 
are selected at the device Y outputs. This selection is controlled 
by the Ig, I7, and l8 microinstruction inputs. Refer to Figure 
4 for the selected output for each microinstruction code 
combination. 

As was discussed previously, the RAM inputs are driven from 
a three-input multiplexer. This allows the ALU outputs to be 
entered non-shifted, shifted up one position (X2) or shifted 
down one position (-r 2). The shifter has two ports; one is 
labeled RAMn. and the other is labeled RAM3. Both of these 
ports consist of a buffer-driver with a three-state output and 
an input to the multiplexer. Thus, in the shift-up mode, the 
RO buffer is enabled and the Rl multiplexer input is enabled. 
Likewise, in the shift down mode, the LO buffer and LI input 
are enabled. In the no-shift mode, both the LO and RO buffers 
are in the high-impedance state, and the multiplexer inputs are 
not selected. This shifter is controlled from the l6, I7, and Is 
microinstruction inputs as defined in Figure 7. Similarly, the 
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Figure 4. Detailed Am2901 Microprocessor Block Diagram. 
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Figure 5. ALU Source Operand Control. 



Q register is driven from a three-input multiplexer. In the 
no-shift mode, the multiplexer enters the ALU data into the 
Q register. In either the shift-up or shift-down mode, the 
multiplexer selects the Q register data appropriately shifted 
up or down. The Q shifter also has two ports; one is labeled 
O-0 and the other is Q3. The operation of these two ports is 
similar to the RAM shifter and is also controlled from l6, I7, 
and Is as shown in Figure 7. 

The clock input to the Am2901 controls the RAM, the Q 
register, and the A and B data latches. When enabled, data 
is clocked into the Q register on the LOW-to-HIGH transition 
of the clock. When the clock input is HIGH, the A and B 
latches are open and will pass whatever data is present at the 
RAM outputs. When the clock input is LOW, the latches are 
closed and will retain the last data entered. If the RAM-EN is 
enabled, new data will be written into the RAM file (word) 
defined by the B address field when the clock input is LOW. 
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Figure 6. ALU Function Control. 



There are eight source operand pairs available to the ALU as 
selected by the In., h, and I2 instruction inputs. The ALU can 
perform eight functions, five logic and three arithmetic. The 
I3, I4, and I5 instruction inputs control the function selection. 
The carry input, C n , also affects the ALU results when in the 
arithmetic mode. The C n input has no affect in the logic mode. 
When In through I5 and C n are viewed together, the matrix of 
Figure 8 results. This matrix fully defines the ALU/source 
operand function for each state. 

The ALU function can also be examined on a "task" basis; 
i.e., add, subtract, AND, OR, etc. In the arithmetic mode, the 
carry will affect the function performed while in the logic 
mode, the carry will have no bearing on the ALU output. 
Figure 9 defines the various logic operations that the Am2901 
can perform, and Figure 10 shows the arithmetic functions of 
the device. Both carry-in LOW (C n = 0) and carry-in HIGH 
(C n = 1) are defined in these operations. 
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Figure 7. ALU Destination Control. 
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Figure 8. Source Operand and ALU Function Matrix. 



DETAILED LOGIC 

Data is input to the ALU froir the data bus via the direct 
inputs, D. The instruction field lg_8 will determine whether 
the destination of the data is one of the 16 general purpose 
registers selected by the register field Rb, the temporary 
register Q, or whether no storage action is to be taken. The 
remainder of the instruction field, lfj-5, will determine which 
two data sources will be selected for operation by the combi- 
natorial arithmetic and logic array and the function to be 
performed by the array. 

Data is output from the ALU through the Y outputs. The Y 
outputs have a high-impedance, three-state condition that is 
controlled by the signal OE*;the signal SALU* which enables 
OE* comes from the CCU and will be discussed with the bus 
control concepts. The source of the output data and any 
operations that are performed on it are controlled by Ra, Rb. 
and lo-8- 

During any operation in the Am2901, the data or operational 
status signals are monitored. The signals OVR, overflow, and 
F15, the sign- bit, are important status bits for arithmetic 
operations and tests. There is a function equal to zero detector 
in the Am290T with an open-collector output. The two zero 
detectors from the least significant byte (Figure 11) F01 and 
F02, are tied together with a pull-up resistor and form the 
signal CLB, compare lower byte. Similarly, the two zero 
detectors from the upper byte, F03 and F04, are tied together 
to form CUB, compare upper byte. In turn, CLB and CUB are 
AND'd together to form the signal ZERO; the result of the 
previous operation was a logical or arithmetic "0" for the 
entire word. 

The generate and propagate signals, G* and P*, originating 
from the most significant end of the Am2902 lookahead carry 



generator are used to provide arithmetic comparisons between 
two words. If the arithmetic function A-B is performed in the 
Am2901, the signal G* may be inverted to derive the condition 
A > B; if the result G equals "1 ," then A is greater than B. By 
taking the logic AND of P* and'G*, the signal A<B may be 
derived; this signal is also positive true. If A is arithmetically 
equivalent to B, the signal ZERO will be true. 

(These tests are for magnitude only comparisons. To compare 
signed, two's complement numbers, the logic would be mod- 
ified to use F-|5 and ZERO.) 

The seven basic or derived signals, F15, OVR, ZERO, CUB, 
CLB, A>B, and A<B are presented to an eight-bit register, 
Am25LS273 or equivalent (Figure 11). This register is loaded 
everytime the ALU is selected for an operation by the CCU; 
the ALU clock, CP1, is generated in the CCU. At the time the 
computer is turned on or a master reset operation is manually 
performed, the signal TOCLR*, turn-on clear, is generated to 
initialize the selected storage devices in the system to an 
appropriate known state. This initialization places the control 
unit in a known state, as well as the various signals upon which 
CCU decisions are made. 

The output of the condition code register is presented to an 
eight-input multiplexer (Figure 11), an Am25LS251, with 
three-state outputs. In addition to the seven signals from the 
condition code register, the condition of the carry flip-flop, 
CARRY, is also provided to the multiplexer. The microcontrol 
signals C3_6 are used to select an ALU condition code for 
conditional branching in microcode or machine code. 

The carry generator and flip-flop logic are shown in Figure 12. 
The input to the carry flip-flop is selected from one-of-six 
sources by an Am25LS151 multiplexer. The flip-flop may be 
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Figure 9. ALU Logic Mode Functions (C n Irrelevant). 



set to "1" or cleared to "0" by selecting either V cc or ground. 
For arithmetic operations, the source of data for the carry 
flip-flop will be the carry-out signal, COUT, from the most 
significant Am2901. For a shifting operation, the most signi- 
ficant bit from the RAM-Register, LRO, may be selected. If 
no change in the carry status is desired in the carry signal itself, 
CAR RY may be selected. Carry signal selection is accomplished 
through the microcode signals Crj-2- The ALU clock, CP1, is 
used to clock-in the new carry data. For some ALU functions, 
it is desirable to force the carry-in signal to "1" to "0." The 
microcode signals FCO and FC1 accomplish this function. The 
signal CIN is the carry-in signal that is presented to the Am2902 
lookahead carry generator and the least significant Am2901. 

The detailed logic for the shift linkages is shown in Figure 13. 
The most and least significant RAM shifter bits are LRO and 
RRO. And the most and least significant Q-Register bits are 
LQO and RQO. 

The potential logic shift linkages are shown diagrammatically 
in Figure 14, while the rotate shift linkages are shown in 
Figure 15. Because there are a large number of instruction/ 
signal combinations to keep track of, the information has been 
tabulated in Figure 16. By diagramming the data flow through 
the registers for each instruction, a table may be compiled by 
instruction type showing the input port and the signal name. 
It can be seen from the Am2901 destination control signals 
I678 thst. whenever there is a shift right that I7 is at logic zero. 
The RAM and Q-LSB multiplexers may therefore be enabled 
by I7* and the RAM and Q-MSB multiplexers may be enabled 
by I7. Because the only instances when the Q-MSB multiplexer 
is needed is for long right shifts or rotates, the select signals 
may be constant as shown. The remaining select bits. So to S5, 
control the remainder of the shift linkage. Further, since 
the MSB and LSB multiplexers are not used simultaneously, 
Sq, and Si may be common to S2 and S3, or S2 and S3 may 
be common to S4 and S5. 

MICROPROGRAM CONTROL OF THE ALU 

A summary of the ALU microinstruction control signals is 
presented in worksheet form in Figure 17. A work sheet of 
this type will prove very helpful during the design and test of 
the ALU as a subsystem. Also, the worksheet provides a simple 
mechanism for algorithm development on a subsystem level. 
Space is provided for line number and the instruction descrip- 
tion, and a column is provided for each control signal that is 
not derived in the ALU but rather is sourced from the CCU or 
another control source. 

The signal TOCLR* is generated automatically when power 
is turned on in the computer, or manually when a master reset 
is generated from the control panel. This signal is used to 
initialize the entire system from a known starting point; all 
SSI and MSI storage elements are directly cleared or preset by 
this signal. 

CP1 is the Am2901 clock. This signal is generated in the com- 
puter control unit whenever the ALU instructions are being 
executed. All storage elaments operate on the LOW-to-HIGH 
transition of CP1. Whenever data is to originate in the ALU 
and be transferred elsewhere, the signal SALU* is true and 
causes the data selected in the ALU to be impressed on the 
data bus (Figure 3). 

The RAM selection fields A and B may originate from the 
Instruction Register or from the microinstruction field. There- 
fore, for most instructions the hardware designer does not 
have to concern himself with the actual contents of the signals 
RAq-3 and RBfj-3, but rather the fact that they are used. The 
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Figure 10. ALU Arithmetic Mode Functions. 
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Figure 11. Detailed Logic Diagram Condition Code Generator, Register and Multiplexer. 




Figure 12. Detailed Logic Diagram Carry Generator and Flip-Flop. 
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Figure 13. Detailed Logic Diagram Shift Linkages. 

22 



LRO 



RRO H — "0" 



rcio - 



Logical Left 
^ — | LRO RRO 



RRO") ^ 



Logical Right 



LRO RRO 



LQO RQO \ — — "0" 

Long Logical Left 



Logical Right w/Carry 



"0" — \ LRO RRO 



fcl— - |"LRO RRO K - "0" /■ 

^— H LOO. RQO | v 

Long Logical Right 

Logical Left w/Carry 



LRO RRO 



b 



LQO 



RQO h — 



■0" 



LRO RRO 



LQO RQO 



Long Logical Left w/Carry Long Logical Right w/Carry 



Figure 14. Logical Shift Linkages. 
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Figure 15. Rotate Shift Linkages. 
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Figure 16. Shift Linkage Instruction, Multiplexer and Control Functions. 
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Figure 17. ALU Microinstruction Control Fields. 
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Figure 18. ALU Micro Instruction Control Examples. 



carry control, condition select, and shift linkage control fields 
were all discussed in detail above. 

Sixteen of the instructions described in Section 5 that affect 
the ALU are presented here (Figure 18) in microcode form. 
It is important to remember that the microcode presented in 
this table is not complete in itself; it is merely that portion of 
the code that controls the execution of the ALU. 

All of these instructions are relatively straight-forward in that 
they only require a single machine cycle except for Add 
Register to Memory, Line 2. Because one of the operands is 
coming from memory and the other operand is coming from a 
general purpose register, the sum of the two operands must be 



temporarily stored in the Q Register. On the next cycle, then, 
a Store Q-Register, Line 15, must be executed to return the 
sum to the addressed memory location. 

Note also the difference between the RR and RX Load Register 
instructions of Lines 12 and 13, and the fact that the ALU 
does not receive a clock pulse, CP1, during the execution of 
the Store Register instructions of Lines 14 and 15. 

If the reader does not have experience in the design of micro- 
controlled processors, it will be useful to rewrite the micro- 
instructions for some of the ALU instructions that were 
presented in Figure 18. Figure 17 will provide a good work- 
sheet. 
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CHAPTER VI 

PROGRAM CONTROL UNIT 



The program control unit is built into the computer to main- 
tain the address of the next instruction to be executed, the 
next instruction operand to be fetched, or to control program 
modification. Program modification consists of controlling 
jumps, jumps-to-subroutines, and returns-from-subroutine; a 
process also called program linkage. 

The address of the next instruction to be executed is main- 
tained in the Program Counter (or Program Counter Register). 
The CCU fetches the first word of the instruction and loads it 
into the Instruction Register which resides in the CCU. At the 
same time that the instruction load occurs, the PC register is 
incremented, thereby pointing to the next instruction or the 
second word of the current instruction. If the address(es) of 
the instruction operand(s) have to be calculated, the calcula- 
tion will be performed in the ALU. The calculated address is 
called an Effective Address, EA, and will be loaded into the 
MAR for the operand fetch. 

A block diagram of this basic PCU is shown in Figure 19 in the 
form it might take with an MSI implementation. Notice that 
all address information is introduced into the PCU from the 
data bus. The PC register is resident in the subroutine stack 



which is 16 levels deep. The RAM used for the subroutine 
stack is controlled by a four-bit up-down counter used as the 
stack pointer. When the primary power is turned on, the stack 
pointer is cleared to zero and the zero-th word in the stack 
becomes the PC register. The output of the RAM is connected 
to the input of a latch which in turn is connected to a full 
adder used as an incrementer. The output of the incrementer 
and the data bus is presented to a 2-input multiplexer, the 
output of which is connected to the data input of the RAM. 
(Although not shown specifically in the diagram, the latch is 
necessary to avoid a race condition.) 

If a Jump instruction is executed, the stack input multiplexer 
selects the data bus as the address source and the address is 
loaded into the PC register. If a Jump-to-Subroutine instruc- 
tion is executed, the PC register is incremented at the same 
clock pulse (this is the return address) and stack pointer is 
incremented (or PUSHed) and the address of the first instruc- 
tion of the subroutine is loaded into the new RAM word which 
is now the PC register for this subroutine level. When a Return- 
from-Subroutine instruction is encountered, the stack pointer 
is decremented (or POPed) and the program execution will 
continue with the instruction after the Jump-to-Subroutine. 
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Figure 19. MSI Program Control Unit Block Diagram. 



Up to 15 levels of subroutine may be implemented with this 
hardware implementation. The MSI PCU described above takes 
approximately thirty 16-pin packages. 

Figure 20 provides an LSI implementation of the PCU described 
above in four 20-pin and one 16-pin package. (The one dif- 
ference between the two systems is that the LSI version is only 
capable of four levels of subroutine.) The LSI PCU is imple- 
mented with four Am2911's and a microcontrol PROM. The 
Am2911 is a four-bit slice microsequencer which is derived 
from the 28-pin Am2909. 



THE Am2909 AND Am2911 

The Am2909 is a bipolar microprogram sequencer intended 
for use in high-speed microprocessor applications. The device 
is a cascadable four-bit slice such that two devices allow 
addressing of up to 256 words of microprogram and three 
devices allow addressing of up to 4K words of microprogram. 
A detailed logic diagram is shown in Figure 21 . 

The device contains a four-input multiplexer that is used to 
select either the address register, direct inputs, microprogram 
counter, or file as the source of the next microinstruction 
address. This multiplexer is controlled by the So and S-| inputs. 

The address register consists of four-D-type, edge-triggered 
flip-flops with a common clock enable. When the address 
register enable is LOW, new data is entered into the register 
on the clock LOW-to-HIGH transition. The address register is 
available at the multiplexer as a source for the next micro- 
instruction address. The direct input is a four-bit field of 
inputs to the multiplexer and can be selected as the next 
microinstruction address. This allows an N-way branch where 
N is any word in the microcode. 
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Figure 20. LSI Program Control Unit Block Diagram. 



The Am2909 contains a microprogram counter (^uPC) that is 
composed of a four-bit incrementer followed by a four-bit 
register. The incrementer has carry-in (C n ) and carry -out 
(C n +4) such that cascading to larger word lengths is straight- 
forward. The /jPC can be used in either of two ways. When the 
least significant carry-in to the incrementer is HIGH, the 
microprogram register is loaded on the next clock cycle with 
the current Y output word plus one (Y+1 -» uPC). Thus, 
sequential microinstructions can be executed. If this least 
significant C n is LOW, the incrementer passes the Y output 
word unmodified and the microprogram register is loaded 
with the same Y word on the next clock cycle (Y -> /uPC). 
Thus, the same microinstruction can be executed any number 
of times by using the least significant C n as the control. 

The last source available at the multiplexer input is the 4x4 
file (stack). The file is used to provide return address linkage 
when executing microsubroutines. The file contains a built-in 
stack pointer (SP) which always points to the last file word 
written. This allows stack reference operation (looping) to be 
performed without a push or pop. 

The stack pointer operates as an up/down counter with sepa- 
rate push/pop and file enable inputs. When the file enable 
input is LOW and the push/pop input is HIGH, the PUSH 
operation is enabled. This causes the stack pointer to incre- 
ment and the file to be written with the required return 
linkage — the next microinstruction address following the 
subroutine jump which initiated the PUSH. 

If the file enable input is LOW and the push/pop control is 
LOW, a POP operation occurs. This implies the usage of the 
return linkage during this cycle and thus a return-from sub- 
routine. The next LOW-to-HIGH clock transition causes the 



stack pointer to decrement. If the file enable is HIGH, no 
action is taken by the stack pointer regardless of any other 
input. 

The stack pointer linkage is such that any combination of 
pushes, pops or stack references can be achieved. One-micro- 
instruction subroutine can be performed. Since the stack is 
four words deep, up to four microsubroutines can be nested. 

The ZERO input is used to force the four outputs to the 
binary zero state. When the ZERO input is LOW, all Y outputs 
are LOW regardless of any other inputs (except OE). Each Y 
output bit also has a separate OR input such that a conditional 
logic one can be forced at each Y output. This allows jumping 
to different microinstructions on programmed conditions. 

The Am2909 features three-state Y outputs. These can be 
particularly useful in military designs requiring external 
Ground Support Equipment (GSE) to provide automatic 
checkout of the microprocessor. The internal control can be 
placed in the high impedance state, and preprogrammed 
sequences of microinstructions can be executed via external 
access to the control ROM/PROM. 

The Am291 1 microprogram sequencer. Figure 22, is a 20-pin 
version of the Am2909. There are two differences between the 
Am2911 and the Am2909: (1) the direct and register inputs 
are separate on the Am2909 and common on the Am2911, 
and (2) there are no OR inputs on the Am2911. Otherwise, 
the two parts are identical. Although intended especially for 
control of microprogram memory, these devices contain much 
of the logic required to control the main memory as well. The 
Am291 1 is used in the PCU described here. 
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Figure 21. Microprogram Sequencer Block Diagram; Am2909. 



DETAILED LOGIC AND MICRO CONTROL 

Almost all of the logic detail is in the Am291 1 itself. The 
remainder of the logic in the PCU lies in the control PROM, 
an Am29751 . There are five address inputs on the PROM: four 
instruction inputs and one test condition input. Although the 
test condition signal, TC, is separated in the discussion of the 
circuit, it is important to understand that TC is in fact a PCU 
instruction bit. TC does not originate in the microcode, but 
rather from the ALU and CCU test condition multiplexers. 
The eight PROM outputs provide all of the control necessary 
for the Am2911's. 

As with the ALU, the PCU clock CP2 is gated by the computer 
control unit. CP2 is withheld from the PCU for those instruc- 
tions that are not to cause address storage or modification. 
There are no other external control signals distributed to the 
PCU. 

The memory address register will be the address or holding 
register in the Am291 1. The PC register will be the micro-PC 
register, and the existing stack will be used as it would be in 
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Figure 22. Am2911 Microsequencer Block Diagram. 



a microsequencer. The data inputs will be connected to the 
data bus and the three-state outputs, Y, will be connected to 
the address bus. Control of the various components within the 
Am2909 and Am291 1 is summarized in Figure 24. 

With the four PCU instruction input lines and the test condi- 
tion line in mind, 16 microinstructions were characterized 
(Figure 23). The clock, microcode and test condition inputs 
are presented on the left and the eight PROM outputs are 
presented on the right. Instruction Op is a clear instruction 
that is used in the power-up sequence to start the machine at 
address OOOOf, while instruction 1p is the normal execution 
instruction. That is, 1p causes the PC register to be incre- 
mented during each clock cycle. 

There are three load instructions: Load MAR, 2p, Load PC, 
3f and Load PC Incremented, 4p. Source MAR, 5f, and 
Source DB, 6p, do not change the storage condition of the 
Am2911's, as the clock pulse is withheld. The internal address 
multiplexer is merely used to select a source for the address 
and to present it to the address bus. These first seven instruc- 
tions are unconditional; the microsequencer control bits are 
the same regardless of the test condition input. 

The Suspend instruction, 7p, is conditional, however, and will 
cause the PC register to stop incrementing and the Y address 
outputs to go to the high impedance condition. The suspend 
instruction is used for DMA access and diagnostic and other 
test conditions. The Hold instruction, 8p, is similar to the 
suspend instruction. The two differences are that it is uncondi- 
tional, and the Y address outputs are active. 

The program modification instructions Jump, Jump-to-Sub- 
routine, and Return, 9 and 10f, 11f. ar| d 12f, respectively, 
are all conditional. If the test fails — the test condition is not 
true — the program continues. There are two Jump instruc- 
tions — 9f selects the MAR as the source of the jump address, 
and 10p selects the data bus as the jump address source. 

The last three instructions, 13p, 14p, and 15p are Loop 
instructions. Begin Loop, 13p, causes the starting address for 
the loop to be pushed onto the stack. The Loop command 
instruction, 15p, will reference the stack for the Loop starting 
address if the test condition is false and cause a jump to that 
location. If the test condition is true, meaning that the loop is 
complete, the command to loop is ignored and an Execute is 
performed as is a Pop stack, thereby clearing the loop reference. 
Another way to exit the loop is the Branch and Pop instruc- 
tion, 14f. If TC is true, a branch is executed; otherwise, the 
instruction is ignored and a normal Execute instruction is 
performed. The loop instructions are presented here for refer- 
ence only. There are no machine instructions that use them. 
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Figure 23. PCU Microinstruction Control PROM. 
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CHAPTER VII 
MEMORY SYSTEM 



The memory system is static and has a 64K-word by 16-bit 
architecture. As the design goal stated, an intention to build 
the processor on 9 x 12 inch printed circuit boards, a memory 
partitioning scheme must be employed because a 64K x 16 
memory will not fit on one card. A 16K-word by 16-bit sub- 
system (Figure 25) will fit nicely, however. The Am9140 
4K x 1 static RAM was chosen for this application. 

Am9130 AND Am9140 

The Am9130 products are high performance, low-power, 
NMOS, 4096-bit, static, read/write random access memories. 
They are implemented as 1024 words by 4 bits per word. The 
data input and output signals are bussed together and share 
common I/O pins. 

The Am9140 products are high performance, low-power, 
4K-bit, static, read/write random access memories. They are 
implemented as 4096 words by 1 bit per word. The data input 
and output signals use separate pins for maximum flexibility. 

All interface signal levels are identical to TTL specifications, 
providing good noise immunity and simplified system design. 
The three-state output will drive two full TTL loads or eight 
low-power Schottky loads for increased fan-out, better capaci- 
tive drive and improved bus interface capability. 

Operational cycles are initiated when the Chip Enable signal 
goes HIGH. When the read or write is complete, Chip Enable 



goes LOW to prepare the memory for the next cycle. Address 
and Chip Select signals are latched on-chip to help simplify 
system timing. Output data is also latched and is available until 
into the next operating cycle. 

Figures 26 and 27 show the block diagrams for the Am9130 
and Am9140. The internal functions are similar, differing only 
in the treatment of I/O lines and the corresponding changes in 
the column circuitry to accommodate the alternate organiza- 
tional structure. 

Chip Enable is the master control input that coordinates all 
internal functions. When CE goes HIGH, a memory operation 
is initiated. When the active operation is complete, CE goes 
LOW to preset the memory for the next cycle. There are no 
maximum time restrictions on either CE polarity. 

Notice that there are ten address lines and four I/O lines for 
the Am9130. The Am9140 requires two more addresses but 
two fewer I/O lines, so the pjn configurations are nearly 
identical. The address inputs are latched into the on-chip 
address register by the rising edge of CE. The row addresses 
propagate through the row decoder and into the storage 
matrix to select one of 64 rows. The column addresses propa- 
gate through the column decoder to the output of the matrix 
to select the appropriate column(s) for input to the sense 
amplifier(s) during a read operation or output from the write 
amplifier(s) during a write operation. 
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Figure 25. Memory Subsystem Block Diagram. 
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Figure 26. Am9130 Block Diagram. 
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Figure 27. Am9140 Block Diagram. 



The Chip Select input is also latched on-chip and acts as the 
high order address input. It controls the input and output 
buffers of the memory and allows several chips to be wired 
together for increased capacity. 

The Am9140 has separate single Data-in and Data-Out lines. 
They may be bussed together externally if desired. The 
Am9130 has four bi-directional Data I/O lines that provide 
output data during read cycles and accept input data during 
write cycles. 

The Read input signal controls the nature of the operation 
being performed by the memory. When it is HIGH, the write 
amplifiers are disabled and only a read can be executed. When 



it is LOW during CE HIGH time, the write amplifiers are 
enabled and incoming data will be written in the addressed 
cell(s). 

Output Enable and Output Disable inputs are used to control 
the mode of the three-state output buffer(s). OE must be 
HIGH and OD must be LOW in order for output data to 
appear. If either OE is LOW or OD is HIGH, the data output(s) 
will be off and present a high impedance interface. 

The Memory Status output is a control signal that describes 
the internal state of the memory. It indicates when output 
data is available and valid; and it shows when the CE transi- 
tions may take place. 



DETAILED LOGIC 

Referring again to Figure 25, the memory is organized in four 
memory banks of 4K x 16. This requires 16 Am9140's per 
memory bank. Each bank will be addressed with address bus 
bits, Arj-11, which are common, and by a chip select bit, 
CSrj-3, which is unique to each bank yet common within the 
bank. The DC address current parameter is minimal with a 
maximum of 10>A per line per device; however, with an input 
capacitance of 9.0pF per device the AC load is more signifi- 
cant. For this reason, a buffer has been used between the 
address bus and the Am9140"s; a good choice for this buffer 
is the Am25LS241. 

The chip select lines are driven by a 2:4 demultiplexer 
(Am25LS139) whose input address is determined by address 
bus lines A12 and A13. The demultiplexer is enabled by 
SELCT*, which is connected to the gate, being true. SELCT* 
is derived from a comparison of the two most significant 
■address bits, A14 and A15, and two switches that reside on 
each memory subsystem printed circuit board. So and S-\. 
When the computer system is assembled, each memory board 
in the system must be given a unique "page" address. As there 
are only four pages allowed in this particular computer, the 
two switches are capable of specifying the page and the two 
address bits are capable of selecting the proper page. The two- 
bit comparator is a pair of open collector Exclusive-NOR gates. 



The memory status signals, MSrj-3, are generated by taking 
the logical AND of all 16 individual MS's in each row, while 
MSrj-3* are derived by taking the logical NOR of the same 
signals. 

The CCU controls the memory system with two signals: 
MEMRQ*, and MEMRD*. MEMRQ*, when true, is a request 
for a memory cycle. MEMRD*, when true, indicates that the 
requested cycle is a read cycle; when false, a write cycle has 
heen requested. Although MEMRQ* initiates a cycle request, 
the Am9140 requires the chip enable signal, CE, to be true to 
start a cycle. CE will be true if a memory cycle request has 
been generated, and the page was selected, SELCT, and none 
of the RAM's is currently completing a memory cycle, MSfj-3* 
are all false. This combinatorial signal STRT*, is loaded into a 
D flip-flop whose Q output is CE. CE is then distributed to the 
chip enable controls and in turn is inverted with an open- 
collector device that is connected to the ready line, RDY, 
making it fa'lse. When memory access has been completed, 
all of the memory status lines, MSrj-3, are true, generating a 
complete signal, CMPLT*. CMPLT* resets the status flip-flop 
making RDY true. 

MEMRD is also gated with SELCT, precluding afaulty attempt 
to write which might result in a loss of data. The derived signal, 
READ, is used to control the direction of the data bus trans- 
ceiver, Am25LS241's, and the Am9140's output enable and 
read/write signals, OE and R. 
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CHAPTER VIII 

INTERRUPT CONTROL UNIT 
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The Am2914, Figure 29, is a high-speed, eight-bit priority 
interrupt unit that is cascadable to handle any number of 
priority interrupt request levels. The device provides a full 
vectored priority output as determined by the highest priority 
interrupt request. 



The key elements of the Am2914 include an eight-bit dual 
rank interrupt register, an eight-bit mask register, and a three- 
bit status register. The device is fabricated using advanced low- 
power Schottky technology and is packaged in a 40-pin 
ceramic DIP. 

The Am2914 can perform 16 instructions (Figure 30), associa- 
ted with interrupt control, vector read, mask register control, 
and status register control. The device can be cascaded in either 
a ripple mode or a parallel mode to provide high-speed priority 
interrupt detection. 

Circuits in the Am2914 include a group enable flip-flop that 
is used as a status pointer. In larger systems, only one group 
enable flip-flop will be LOW at a time. This results in inhibiting 
interrupt requests in all lower priority blocks. The block in 
which this enable flip-flop is LOW can be thought of as the 
decoded binary levels of the more significant status register 
bits. 
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Figure 28. Interrupt Control Unit Block Diagram. 
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The microinstruction set for the Am2914 priority interrupt 
circuit is in Figure 30. In this instruction set, the master clear 
instruction is selected as binary zero such that during a power 
up sequence, the pipeline register in the microprogram control 
unit of the central processor can be cleared (0000). Thus, on 
the next clock cycle, the Am291 4 will execute the master clear 
function. This includes clearing all interrupt registers as well as 
the mask register, status registers, and group enable flip-flop 
except at the least significant block. The least significant group 
enable flip-flop is set LOW by tieing the Group Advance 
Receive input LOW. All other Group Advance Receive inputs 
are tied to Group Advance Send outputs, and these are forced 
HIGH during this instruction. This clear instruction also 
enables the interrupt request flip-flop such that a fully inter- 
rupt driven system can be easily initiated from any interrupt. 

The read vector output instruction is used to read the vector 
value of the highest priority causing the interrupt. The vector 
outputs are three-state drivers that are enabled onto the 
V V 1 V 2 Dus during this instruction. When multiple Am2914's 
are used, the higher order vector bits can be encoded using an 
Am2913 priority encoder. This instruction also causes the 
value "vector plus one" to be automatically loaded into the 
status register. Thus, the status register always points to the 
lowest level at which the user is willing to accept an interrupt. 
AH lower priority levels than that pointed to by the status 
register are, therefore, inhibited. By loading the status register 
with the value "vector plus one," multiple interrupts at the 
same priority level are inhibited. In addition, this instruction 
loads the current vector value into a holding register such that 
this value can be used by the clear interrupt from the vector 
instruction. This allows the user to read the vector associated 
with the interrupt, update the status register to any value 
(possibly much higher than "vector plus one"), and then clear 
the interrupt flip-flop in the interrupt register associated with 
the vector read. 



Am2913 

The Am2913 was designed especially as a support component 
for the Am2914 Vector Priority Interrupt Encoder. Similar to 
the Am9318, Am25LS148, and Am74LS148, the Am2913 is 
an eight-input priority encoder which is used to derive the 
vector for the most significant Am2914 location (Figure 33). 
The vector output, Arj-2, is three-state and controlled by the 
gating function G-]_5. 

The Am2913 is cascadable by connecting the enable output, 
EO*, to the enable input. El*, of the next least significant 
Am2913. 



Am29705 

The Am29704 and Am29705 are 16-word by 4-bit, 2-port 
RAM's built using advanced low-power Schottky processing. 
These RAM's feature two separate output ports such that any 
two 4-bit words can be read from the outputs simultaneously. 
Each output port has a four-bit latch but a common Latch 
Enable (LE) input is used to control all eight latches. The 
device has two Write Enable (WE) inputs and is designed such 
that the Write Enable 1 (WEi) and Latch Enable (LE) inputs 
can be wired together to make the operation of the RAM 
appear edge triggered. 

The device has a fully decoded four-bit A address field to 
address any of the 16 memory words for the A output port. 
Likewise, a four-bit B address input is used to simultaneously 
select any of the 16 words for presentation at the B output 
port. New incoming data is written into the four-bit RAM 
word selected by the B address. The D inputs are used to load 
new data into the device. 
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Figure 29. Am2914 Vectored Priority Interrupt Encoder Block Diagram. 
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Figure 30. Microinstruction Set for Am2914 Priority Interrupt Circuit. 



The Am29704 has open-collector outputs, and the Am29705 
features three-state outputs so that several devices can be 
cascaded to increase the total number of memory words in the 
syst em. T he A output port is in the high impedance state when 
the OE-A input is HIGH. Likewise, the B output port is in the 



high impedance state when the OE-B input is HIGH. Four 
devices can be paralleled using only one Arn25LS139 decoder 
for output control. 

The Write Enable inputs control the writing of new data into 
the RAM. When both Write Enable inputs are LOW, new data 
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Figure 33. Am2913 Function Diagram and Truth Tables. 
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Figure 34. Am29705 Logic Diagram. 
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is written into the word selected by the B address field. When 
either Write Enable input is HIGH, no data is written into the 
RAM. The memory outputs follow the data inputs during 
writing if the latches are open. 

The logic diagram and function tables for the Am29705 are 
shown in Figures 34 and 35, respectively. 



DETAILED LOGIC 

The ICU is one of the most complex subsystems in the com- 
puter because of all the interaction required. The 16 interrupt 
requests, INTRQq,-15*, originate in different input/output 
ports and are presented to the Am2914's for service. In the 
event of an interrupt request, the signal IRQ* is generated and 
transmitted to the CCU for service. When the CCU finishes the 
current instruction and branches to fetch the next instruction, 
the interrupt request causes a branch to the microprogram 
interrupt service routine where the enable interrupt signal 
ENIINT* is generated. The vector of the highest level interrupt 
(INTRQ15* is the highest and INTRQn.* is the lowest) is read 
out and presented to the interrupt acknowledge demultiplexers 
and the interrupt service vector registers. 

The interrupt service vector register file, Am29705, has its 
data input and outputs connected to the data bus. The B port, 
the port which selects the write address, is address controlled 
by the instruction register R-| field (IR7:4), RB. The read/ 
write control, VCTRD, when true, causes a read and when 
false causes a write. BEN* enables the B data port on to the 
data bus. The A port address is the vector address generated by 
the Am2914's. During an interrupt service procedure, the A 
data port output enable, AEN*, is caused to be true, placing 
the main memory interrupt service vector address on the data 
bus. The data bus is monitored by the DMA detector which is, 
in effect, a 16-input NAND gate. If the vector address placed 
on the data bus is a FFFF-|g, the service request is for a DMA 
cycle. If the current interrupt request is not for a DMA cycle, 
an interrupt acknowledge signal, INTAC n *, is generated by a 
4:16 demultiplexer made up of two Am25LS138 3:8 demulti- 



Figure 35. Am29705 Function Tables. 
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Figure 36. Microinstruction Format. 



As there are two 8-input Am2914's generating a 3-bit vector, a 
fourth bit must be generated to resolve the ambiguity between 
the two devices. The Am2913 was employed for this purpose. 
Its gate inputs, Gi_5, are connected to the Am2914 instruc- 
tion lines, I0-3 and IE*, so that the output of the Am2913 
is enabled for the read status command, I = 01 10B, and IE* is 
true. Only one bit of the priority encoder's vector is needed, 
and it is generated by connecting the most significant Am2914 
group status signal, BS* to I1* and the least significant block 
status to lfj*. 

The Am2914's are also connected to the data bus, thereby 
allowing the 16 bits of mask register to be read or loaded, the 
interrupts to be cleared, and the status register to be read or 
loaded. The instruction codes for the Am2914 are presented 
in Figure 36. 
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CHAPTER IX 

DIRECT MEMORY ACCESS 



The DMA controller (Figure 37) in this machine is relatively 
simple in structure and completely controlled by the CCU. 
There are two 16-bit registers in the machine, a base address 
register and a word counter. 

The DMA initialization counter requires that the base address 
register is loaded with the address of the first word to be trans- 
ferred. In the same process, the word counter is loaded with 
the two's complement of the number of data words to be 
transferred. The DMA interrupt service vector, FFFF-|g, is 
loaded into the interrupt service vector register, and the trans- 
fer direction flip-flop is set to the appropriate condition. IO* 
in a true condition, means a write into memory from the 
selected input/output port, while a false condition is a memory 
to input/output device transfer. 

Both registers were built out of Am25LS161 four-bit binary, 
synchronous counters. And the Am25LS2541 non-inverting 
octal buffers were selected for address bus isolation. 



When the first DMA request is generated, the proper read or 
write in main memory occurs. At the end of the memory 
cycle, both the word counter and the base address register 
are incremented. The base address register is then pointing to 
the address of the next main memory transfer, and the number 
of words to be transferred has been decremented by one. This 
process continues until the word counter reaches its maximum 
or terminal count; TERM is true. If terminal count on the base 
address register, DMOVR, becomes true, the memory has 
overflowed. 

Microcontrol of the DMAC consists of two load commands, 
LDWC* and LDBAR*, two increment commands, INCWC and 
INCBA, an output enable control, ENADR*, and a transfer 
direction, DIR. As with the other controlled subsystems, the 
clock, CLK, is gated. 
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Figure 37. Direct Memory Access Controller. 
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CHAPTER X 

THE STATE MACHINE 



Every computer has a predictable sequence of functions that 
occurs depending upon the instruction being executed or the 
automatic hardware functions being performed, i.e., interrupt 
requests, control panel functions, etc. The graph depicting the 
functional flow through the computer sequence is called a 
state transition diagram and is implemented by execution of 
the microprogram. 

The state transition diagram for this computer is shown in 
Figure 38. The sequence starts with a power-up initiation 
function. All of the registers are cleared, the control flip-flops 
are set, and, if the computer has a control panel, the machine 
is probably set to a Halt condition at the end of State 1. 

State 2 tests the Halt request. In its absence, the CCU performs 
a memory read cycle to fetch an instruction. At the end of 
the cycle, the instruction word is loaded into the instruction 
register. 

During State 3, the instruction is decoded, and a determination 
is made as to whether this instruction is one word or two 
words long. If it is a single word register-to-register, RR, or a 
single cycle special function, SF, instruction, the instruction 
is immediately executed and a transfer is made to State 8. If 
a multiple cycle SF instruction has been encountered, the 
same microinstruction is executed until the SF instruction is 
satisfied. If the instruction is a two-word instruction, the CCU 
fetches the operand and stores it in the Q register, then con- 
tinuing to State 4. 

State 4 is used to modify the operand and directly or indirectly 
use it. For RX instructions, a test is made of the X field, 
IRO:3, for a zero value. If X does equal zero, the address of 
the second operand is in Q and must be transferred to the 
MAR. If X is not equal to zero, the contents of the X-th 
general purpose register is added to the contents of the Q 
register and the result is loaded into the MAR. A transition 
is then made to State 6. 

If an immediate instruction is detected in State 4, a test for 
zero is also made on X. When X does not equal zero, the 
contents of the X-th register is added to the contents of the 
Q register, and the result forms the second operand which is 
stored in the Q register. Then transition is made to State 5, 
or, if X equals zero, the transition is immediately made to 
State 5. 

State 5 is the execution state for Rl instructions. The only 
transition from this state is to State 8. 




FETCH DATA EXECUTE 
EXECUTE 

' 

ENABLE INTERRUPT 



Figure 38. Central Processor State Transition Diagram. 

State 6 fetches the second operand for RX instructions from 
the computed address that was previously loaded in the MAR 
during State 4. The next state from State 6 is State 7 where 
the RX instructions are executed. 

State 8 allows the interrupt system to be enabled, and then 
the CCU attempts to branch back to State 2 to fetch the next 
instruction. If there has not been an interrupt, the attempt to 
branch is successful; otherwise, the interrupt request must be 
processed. When the interrupt has been serviced in firmware, 
a branch to State 2 occurs and the interrupt service routine in 
main memory is processed. Execution of a Return from Inter- 
rupt instruction will allow the controller to be returned to its 
pre-interrupt condition and for normal processing to resume. 
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CHAPTER XI 

COMPUTER CONTROL UNIT 



Having defined each of the controlled subsystems in the com- 
puter, and the machine instruction set, the machine instruction 
formats, the "state machine", and the microcontrol bits, the 
design of the computer control unit to drive this processor is a 
relatively straightforward task. 

The architecture of the CCU is presented in'block diagram 
form in Figure 39. The instruction register contains the current 
machine level instruction that is being interpreted and executed 
incrementally by the microprogram. The Op Code portion of 
the instruction is presented to a mapping PROM as an address. 
In turn, the output of the mapping PROM provides the micro- 
sequencer with the starting address of the microprogram that 
satisfies the requirements of the current instruction. 



struction register. Subsequently, the source of the micropro- 
gram PROM address is switched from the mapping PROM to 
the microprogram counter and the process continues. 

The timing, synchronization, and control logic includes the 
power-up initialization hardware, the system clock, the instruc- 
tion decoder for the microsequencer, conditional instruction 
hardware, and the logic that allows processor asynchronous 
events to be synchronized to the CPU. 

Having been given permission to operate from the synchroni- 
zation logic, the bus control and clock distribution hardware 
gates data sources onto the data bus, and clocks data from the 
data bus into controlled destinations. 



Initially, the starting address is passed directly through the 
microsequencer to the address field of the microprogram 
PROM. The first microinstruction appears at the output of the 
PROM some time later and is loaded into the microinstruction 
register (the pipeline register) on the rising edge of the next 
clock pulse. Concomitant with being passed directly to the 
microprogram PROM, an incrementer inside the sequencer 
adds 1 to the starting address and sets up the input to the 
microprogram counter register which is loaded with "address- 
plus-one" by the same clock pulse that loaded the microin- 



INSTRUCTION DECODING 

A detailed logic diagram of the microsubroutine starting 
address hardware is shown in Figure 40. The instruction 
register consists of two sections. First is the eight-bit Op Code 
register that is made from two Am2918 two-input, quad 
registers, and second is the dual operand register comprised 
of two 4-bit registers, Am2918. 
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Figure 39. Computer Control Unit Block Diagram. 
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Figure 40. Microsubroutine Starting Address Block Diagram and Test Condition Multiplexer. 



At the beginning of an instruction sequence, the machine 
instruction is parallel loaded into the instruction register, 
and the Op Code is decoded to provide a starting address as 
described above. And, during the entire execution of the 
instruction, interrupt requests are disabled. But when the 
instruction is completed, the microcontroller executes a micro- 
program that fetches the next instruction and inserts it into 
the instruction register. Between the end of the last instruction 
and the beginning of the instruction fetch, routine machine 
level service requests are enabled. 

Machine level service requests include power-fail interrupts, 
PWRFL*, control panel service requests, PNLR1 *, PNLR2*, 
and PNLR3*, an attempt to DMA read or write, DMARD* or 
DMAWR*, stored program interrupt requests, INTRQ*, and 
the instruction fetch itself, FETCH*. Each of these functions 
has a separate microprogram with a unique starting address. 
Presented in order of priority, these service requests are 
encoded in an Am25LS148, and the three-bit vector is selected 
as the input to the Op Code multiplexer. Op Code/micro- 
interrupt selection is accomplished when FETCH* is true 
causing the priority encoder gate select, GS*, to go low, there- 
by switching the Am2918 output multiplexer. Three bits of 
vector and five bits of "0" are inserted into the Op Code 
multiplexer leaving Op Codes 00-|g through 07-jg unavailable 
for machine instruction assignment. 

Generating machine level service request addresses in this 
manner has two advantages: speed and a smaller microprogram 
word width. While the last microinstruction for the current 
machine instruction is being executed, the fetch signal may be 
enabled, and the first instruction of the most significant service 
routine may be readv to be loaded into the microinstruction 
register by the very next clock pulse. If the proposed tech- 
nique, or one similar to it, is not implemented, a series of test 
and branch instructions must be executed. This technique 



saves microinstruction word width because the PROM field 
would have to be 12 bits wider to provide a branch address 
at the same time that other machine functions were being 
exercised. 

During the instruction fetch and class determination functions, 
it is necessary to detect the value of the X register field for RX 
and Rl instructions for zero. If this 4-bit field is non-zero the 
contents of the specified register is added to the second 
operand to calculate an effective address for the data word to 
be used. Figure 40 allows for the testing of the zero condition 
as well as other signals used for conditional microprogram 
control. 

The register select logic shown in Figure 41 is provided to 
allow for the instruction operand flexibility shown in the 
defined instruction set of Chapter IV. These multiplexers allow 
the source and destination registers for the Am2901 ALU and 
Am29705 ICU/DMA circuits to be selected from either instruc- 
tion register operand field or from one of two fields in micro- 
program memory. Providing the two register select fields in 
microprogram memory allows algorithms to be executed on 
the system. 



MICROPROGRAM SEQUENCING 

The Am2909 microprogram sequencer was described in some 
detail in Chapter VI, and a block diagram is provided here in 
Figure 42 as a reference to be used with Figure 39. Three 
Am2909's are being used to provide an address range of 12 
bits or 4096 words of microprogram storage. All three have 
their equivalent control lines connected together and all 12 
OR inputs are tied to ground, The least significant carry-in 
signal is tied to V C c and all interstage carry-outs and carry-ins 
are appropriately connected. 




Figure 41. Instruction Register/Microprogram 
Register Select Logic. 



R ( Am?903 ONLY) 



LY)V 



PUSH/ POP v FILE ENABLE 



I D AND R 
4 1/ CONNECTED 
A ONAm29l1 
| ONLY 



DIRECT | 
INPUTS 

D > i- 



["Am2909 ONL\H 
! /™3 > 1 



D AR F *jPC 



MULTIPLEXER 



|_l™o > -J" 



ZERO 
> 




MM 



MICROPROGRAM 
COUNTER REGISTER 



NCREMENTER 



Figure 42. Am2909 Microprogram Sequencer Block Diagram. 



The direct inputs are tied to the outputs of the starting address 
PROM's; and the register inputs are tied directly to the outputs 
of the microprogram PROM's, in the branch address field, 
thereby forming part of the microprogram register. All three 
RE*'s are connected to ground causing the address register 
inside of the Am2909 to be loaded on every clock pulse. 

A 32 X 8 PROM, Am29751, Figure 43, is used to control the 
microsequencers. Of the five address inputs, four are used as 
instruction bits and the fifth is used as a qualified (by the 
output, O7) test condition. The other seven outputs are used 
for control of FE*, PUP, SO, SI, and ZERO*, and for genera- 
tion of OPREQ* and HLTRQ*. OPREQ* is used for functions 
thay may have mixed speeds or be asynchronous such as 
memory or I/O reading and writing. HLTRQ* is used to halt 
the processor. The control signals for the microsequencer 
PROM come from an Am25LS161 which is the least significant 
four bits of the microinstruction register. This device was 
chosen for two reasons: during the power-up sequence it may 
be cleared to "0," and with its terminal count gate it will 
detect the presence of all "1's" in the register. The terminal 
count detector generates the FETCH* condition. 

The system clock is one-half of a 74LS124 VCO. Because of 
the speed and need for accuracy in this application, the VCO 
is being crystal-excited by a 5MHz series resonant device. The 
turn-on clear signal, TOCLR*, is generated by an Am555 timer 
using an RC time constant (t = 1.1 RC) and should have a time 
constant of 100msec or greater for most systems. 



A rather powerful microinstruction set is summarized for the 
Am2909 in Figure 44. The inputs to the control PROM are 
shown on the left, and the output control signals are shown on 
the right. Unconditionally occupying the zero-th position is 
the instruction RESET. This instruction causes the address 
out of the Am2909 to go to "0." The second instruction is 
EXECUTE which is also unconditional. EXECUTE selects 
the microprogram counter as the source of address informa- 
tion. EXECUTE is the default condition for most of the 
conditional instructions. 

There are two jump and two jump-to-subroutine instructions 
that reference either the direct inputs or the register inputs. 
There are two return instructions, a classical RETURN for 
in-line programs, and a return with a LOOP default condition 
causing the subroutine to loop until the return condition 
is true. 

There is an unconditional PUSH instruction for initializing a 
loop, a BRANCH and POP instruction that allows an escape 
from a loop without exiting from the bottom. Both condi- 
tional and unconditional loops are provided. An unconditional 
POP is also provided to clear a loop condition or an unwanted 
return address. 

HALT is conditional and causes the CPU to freeze until there 
is a manual intervention or the primary power is cycled off 
and on. EXECUTE ASYNCHRONOUS and EXECUTE and 
FETCH are both unconditional and were discussed above. 
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Figure 43. Timing, Synchronization and Control Logic Schematic. 



DATA BUS CONTROL 

The general form of the bus functional control logic is shown 
in Figure 45. There are three bits of data bus source and three 
bits of data bus destination information is provided. The 
source decoder gates information onto the data bus while 
the destination decoder distributes clock pulses to transfer the 
data from the data bus to a storage device. 

Figure 46 shows an actual set of clock distribution and output 
gate control logic that will satisfy the requirements of the CPU. 
Signals BSRCn_2 control the data bus source and BDSTfj_2 
control the data bus destination. The derived signals are dis- 
tributed to the various subsystems. 



MICROPROGRAM FORMATTING 

The microinstruction register of Figure 39 shows all of the 
control bits necessary to implement all of the control func- 
tions for all of the controlled elements. If summed, the total 



is 76 bits. A microinstruction word that wide would require a 
lot of PROM's, and a lot of registers, and a lot of printed circuit 
board room. This might be acceptable for very high-speed 
processors or even desirable for very flexible machines. For 
almost all applications, however, there is a cost/performance 
trade-off. 

The process of narrowing the microinstruction word is called 
formatting. The first step in formatting is to determine which 
functions are desired or required to go together. Three ex- 
amples of formatting are shown in Figure 47. These formats 
require 48-bit wide microinstruction words. The word width 
may be further shortened to 40 bits if it is deemed undesirable 
to do algorithm control purely from microcode. 

The decision to use formatted microwords or unformatted 
microwords is up to the designer. In an unformatted system, 
the microinstruction will not only be wider than in the for- 
matted words, but a significant number of the bits will not be 
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Figure 44. Am2909 Microprogram Instructions and PROM Pattern. 



used during any given microinstruction. The effect to be 
considered is the fact that there is a lower (usually signifi- 
cantly) ROM efficiency and greater flexibility for unformatted 
microinstructions in comparison to formatted microinstruc- 
tions. Further, formatted microinstruction storage will gener- 
ally be deeper (require a greater address range) than the 
unformatted equivalent whereas the unformatted micro- 
instruction will be wider than the formatted equivalent. The 
use of either technique necessitates the consideration of the 
mechanical layout problems in the system, package count, and 
power requirements. The reason that these trade-offs are 
important can be shown by example: sizing estimates for the 
entire instruction set shown in this document are 160 ROM 
words unformatted versus 210 ROM words formatted, with 
microinstruction widths of 76 bits and 48 bits, respectively. 



Let us consider the industry standard 256 x 4 PROM. Both 
formatted and unformatted versions of the microcode will fit 
into the depth, 256 words, but due to the additional width, 
the unformatted version requires 19 PROM packages versus 
the formatted 12 PROM packages. The formatted version not 
only saves seven 16-pin PROM packages, but five to seven 
square inches of printed circuit board for the seven 16-pin 
pipeline microinstruction registers for a total savings of 10 to 
14 square inches and a total power savings of about 1.1 typical 
to 1 .7 maximum amperes. Clearly, performance needs to be 
a primary factor in the decision process for the final design. 
A reproducible microprogramming form is included here. 
Figure 48, to allow various instructions and techniques to 
be evaluated using as a base the microprogramming examples 
of Figure 18. 




Figure 45. Data Bus Control Logic Technique. Figure 46. Bus Control and Clock Distribution Logic. 
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a) Arithmetic and Program Control Format 
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b) Interrupt, DMA and Control Panel Format 
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c) Microsequence Program Modification Format 



Figure 47. Microprogram Formats. 
47 



I 




